Unlimited Vocabulary Grapheme to PhonemeConversion with Probabilistic Phrase Break Detection
نویسندگان
چکیده
This paper describes a grapheme-to-phoneme conversion method using phoneme con-nectivity and CCV conversion rules with probabilistic phrase break detection. The method consists of mainly four modules including phrase-break detection, morpheme normalization, morpheme to phoneme conversion and phoneme connectivity check. In the experiments with a test corpus of 210 sentences, we achieved 85% of phrase break detection. The grapheme-to-phoneme conversion performance on the 210 sentences was 85.5% and is improved to 90.8% after employing the phrase break detection. The grapheme-to-phoneme conversion performance on the phrase break free and non-Korean symbol free 4,973 test sentences is 99.9%. The full Korean TTS system is now being implemented using these phrase break detection and grapheme-to-phoneme conversion method.
منابع مشابه
Unlimited Vocabulary Grapheme to Phoneme Conversion forKorean
This paper describes a grapheme-to-phoneme conversion method using phoneme connectivity and CCV conversion rules. The method consists of mainly four modules including morpheme normalization, phrase-break detection , morpheme to phoneme conversion and phoneme connectivity check. The morpheme normalization is to replace non-Korean symbols into standard Korean graphemes. The phrase-break detector ...
متن کاملUnlimited Vocabulary Grapheme to Phoneme Conversion for Korean TTS
This paper describes a grapheme-to-phoneme conversion method using phoneme connectivity and CCV conversion rules. The method consists of mainly four modules including morpheme normalization, phrase-break detection, morpheme to phoneme conversion and phoneme connectivity check. The morpheme normalization is to replace non-Korean symbols into standard Korean graphemes. The phrase-break detector a...
متن کامل1 0 Ju n 19 98 Unlimited Vocabulary Grapheme to Phoneme Conversion for Korean TTS
This paper describes a grapheme-to-phoneme conversion method using phoneme connectivity and CCV conversion rules. The method consists of mainly four modules including morpheme normalization, phrase-break detection, morpheme to phoneme conversion and phoneme connectivity check. The morpheme normalization is to replace non-Korean symbols into standard Korean graphemes. The phrase-break detector a...
متن کاملStatistical / Rule - based Hybrid Phrase Break
In this paper, we present a new phrase break detection architecture that integrates proba-bilistic approach with rule-based error correction. The architecture consists of a probabilis-tic phrase break detector and a transformational rule-based post error corrector. The probabilistic method alone usually suuers from performance degradation due to inherent data sparseness problems. So we adopted ...
متن کاملHybrid Grapheme to Phoneme Conversion forUnlimited
Both dictionary-based and rule-based methods on grapheme-to-phoneme conversion have their own advantages and limitations. For example, a large sized phonetic dictionary and complex morphophonemic rules are required for the dictionary-based method and the LTS(letter to sound) rule-based method itself cannot model the complete morphophonemic constraints. This paper describes a grapheme-to-phoneme...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998